Overview

Dataset info

Number of variables22
Number of observations15762
Missing cells0 (0.0%)
Duplicate rows0 (0.0%)
Total size in memory2.6 MiB
Average record size in memory176.0 B

Variables types

Numeric19
Categorical2
Boolean1
Date0
URL0
Text (Unique)0
Rejected0
Unsupported0

Warnings

date has a high cardinality: 369 distinct values Warning
sqft_basement has a high cardinality: 283 distinct values Warning
view has 14241 (90.4%) zeros Zeros
yr_renovated has 15111 (95.9%) zeros Zeros

Variables

bathrooms
Numeric

Distinct count27
Unique (%)0.2%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean2.120796853
Minimum0.5
Maximum8
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.5
5-th percentile1
Q11.75
Median2.25
Q32.5
95-th percentile3.5
Maximum8
Range7.5
Interquartile range0.75

Descriptive statistics

Standard deviation0.7667716477
Coef of variation0.3615488426
Kurtosis1.343529285
Mean2.120796853
MAD0.6122442832
Skewness0.5179102509
Sum33428
Variance0.5879387597
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=27)
Histogram
Histogram with variable size bins (bins=[0.5 0.625 0.875 1.125 1.375 ... 3.625 4.125 4.625 5.625 8. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2.5 4004 25.4%
 
1 2761 17.5%
 
1.75 2226 14.1%
 
2.25 1487 9.4%
 
2 1395 8.9%
 
1.5 1060 6.7%
 
2.75 853 5.4%
 
3 543 3.4%
 
3.5 543 3.4%
 
3.25 428 2.7%
 
Other values (17) 462 2.9%
 

Minimum 5 values

ValueCountFrequency (%) 
0.5 3 < 0.1%
 
0.75 50 0.3%
 
1 2761 17.5%
 
1.25 6 < 0.1%
 
1.5 1060 6.7%
 

Maximum 5 values

ValueCountFrequency (%) 
8 2 < 0.1%
 
7.75 1 < 0.1%
 
7.5 1 < 0.1%
 
6.75 1 < 0.1%
 
6 5 < 0.1%
 

bedrooms
Numeric

Distinct count12
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean3.378949372
Minimum1
Maximum33
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile2
Q13
Median3
Q34
95-th percentile5
Maximum33
Range32
Interquartile range1

Descriptive statistics

Standard deviation0.9353010799
Coef of variation0.2768023361
Kurtosis65.29214176
Mean3.378949372
MAD0.7353872581
Skewness2.5301133
Sum53259
Variance0.87478811
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=12)
Histogram
Histogram with variable size bins (bins=[ 1. 1.5 2.5 3.5 4.5 5.5 6.5 7.5 10.5 33. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
3 7120 45.2%
 
4 5079 32.2%
 
2 2003 12.7%
 
5 1183 7.5%
 
6 192 1.2%
 
1 141 0.9%
 
7 23 0.1%
 
8 10 0.1%
 
9 6 < 0.1%
 
10 3 < 0.1%
 
Other values (2) 2 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
1 141 0.9%
 
2 2003 12.7%
 
3 7120 45.2%
 
4 5079 32.2%
 
5 1183 7.5%
 

Maximum 5 values

ValueCountFrequency (%) 
33 1 < 0.1%
 
11 1 < 0.1%
 
10 3 < 0.1%
 
9 6 < 0.1%
 
8 10 0.1%
 

condition
Numeric

Distinct count5
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean3.410861566
Minimum1
Maximum5
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile3
Q13
Median3
Q34
95-th percentile5
Maximum5
Range4
Interquartile range1

Descriptive statistics

Standard deviation0.6519608965
Coef of variation0.1911425849
Kurtosis0.4942516587
Mean3.410861566
MAD0.562117155
Skewness1.0385374
Sum53762
Variance0.4250530106
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=5)
ValueCountFrequency (%) 
3 10221 64.8%
 
4 4137 26.2%
 
5 1254 8.0%
 
2 131 0.8%
 
1 19 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
1 19 0.1%
 
2 131 0.8%
 
3 10221 64.8%
 
4 4137 26.2%
 
5 1254 8.0%
 

Maximum 5 values

ValueCountFrequency (%) 
5 1254 8.0%
 
4 4137 26.2%
 
3 10221 64.8%
 
2 131 0.8%
 
1 19 0.1%
 

date
Categorical

Distinct count369
Unique (%)2.3%
Missing (%)0.0%
Missing (n)0
6/25/2014
 
103
6/23/2014
 
102
7/14/2014
 
93
Other values (366)
15464
ValueCountFrequency (%) 
6/25/2014 103 0.7%
 
6/23/2014 102 0.6%
 
7/14/2014 93 0.6%
 
7/8/2014 93 0.6%
 
4/28/2015 93 0.6%
 
10/28/2014 92 0.6%
 
5/20/2014 89 0.6%
 
8/20/2014 89 0.6%
 
4/27/2015 89 0.6%
 
4/22/2015 88 0.6%
 
Other values (359) 14831 94.1%
 
Max length10
Mean length8.924375079
Min length8
Contains charsFalse
Contains digitsTrue
Contains spacesFalse
Contains non-wordsTrue

df_index
Numeric

Distinct count15762
Unique (%)100.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean10782.56941
Minimum1
Maximum21596
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile1064.05
Q15398.25
Median10782.5
Q316150.75
95-th percentile20476.95
Maximum21596
Range21595
Interquartile range10752.5

Descriptive statistics

Standard deviation6218.086579
Coef of variation0.5766794855
Kurtosis-1.194688287
Mean10782.56941
MAD5381.085712
Skewness-0.001794292219
Sum169954859
Variance38664600.7
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[1.0000e+00 2.1596e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2047 1 < 0.1%
 
6806 1 < 0.1%
 
14994 1 < 0.1%
 
8849 1 < 0.1%
 
21135 1 < 0.1%
 
19084 1 < 0.1%
 
4743 1 < 0.1%
 
645 1 < 0.1%
 
2692 1 < 0.1%
 
14978 1 < 0.1%
 
Other values (15752) 15752 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
1 1 < 0.1%
 
3 1 < 0.1%
 
4 1 < 0.1%
 
5 1 < 0.1%
 
6 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
21596 1 < 0.1%
 
21594 1 < 0.1%
 
21593 1 < 0.1%
 
21592 1 < 0.1%
 
21591 1 < 0.1%
 

floors
Numeric

Distinct count6
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1.495146555
Minimum1
Maximum3.5
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile1
Q11
Median1.5
Q32
95-th percentile2
Maximum3.5
Range2.5
Interquartile range1

Descriptive statistics

Standard deviation0.5393516866
Coef of variation0.360734996
Kurtosis-0.5043981528
Mean1.495146555
MAD0.4886124551
Skewness0.6056036105
Sum23566.5
Variance0.2909002418
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%) 
1 7777 49.3%
 
2 6049 38.4%
 
1.5 1374 8.7%
 
3 439 2.8%
 
2.5 117 0.7%
 
3.5 6 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
1 7777 49.3%
 
1.5 1374 8.7%
 
2 6049 38.4%
 
2.5 117 0.7%
 
3 439 2.8%
 

Maximum 5 values

ValueCountFrequency (%) 
3.5 6 < 0.1%
 
3 439 2.8%
 
2.5 117 0.7%
 
2 6049 38.4%
 
1.5 1374 8.7%
 

grade
Numeric

Distinct count11
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean7.663748255
Minimum3
Maximum13
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum3
5-th percentile6
Q17
Median7
Q38
95-th percentile10
Maximum13
Range10
Interquartile range1

Descriptive statistics

Standard deviation1.172238401
Coef of variation0.1529588867
Kurtosis1.151892098
Mean7.663748255
MAD0.9280617225
Skewness0.8035754559
Sum120796
Variance1.374142869
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=11)
Histogram
Histogram with variable size bins (bins=[ 3. 4.5 5.5 6.5 7.5 ... 9.5 10.5 11.5 12.5 13. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
7 6539 41.5%
 
8 4438 28.2%
 
9 1920 12.2%
 
6 1482 9.4%
 
10 832 5.3%
 
11 290 1.8%
 
5 167 1.1%
 
12 66 0.4%
 
4 16 0.1%
 
13 11 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
3 1 < 0.1%
 
4 16 0.1%
 
5 167 1.1%
 
6 1482 9.4%
 
7 6539 41.5%
 

Maximum 5 values

ValueCountFrequency (%) 
13 11 0.1%
 
12 66 0.4%
 
11 290 1.8%
 
10 832 5.3%
 
9 1920 12.2%
 

id
Numeric

Distinct count15676
Unique (%)99.5%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean4593363852
Minimum1000102
Maximum9895000040
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1000102
5-th percentile522049616.6
Q12125159330
Median3905081210
Q37334501432
95-th percentile9297301504
Maximum9895000040
Range9893999938
Interquartile range5209342102

Descriptive statistics

Standard deviation2876078445
Coef of variation0.6261377363
Kurtosis-1.265290014
Mean4593363852
MAD2544792076
Skewness0.2348705489
Sum7.240060104e+13
Variance8.27182722e+18
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[1.00010200e+06 1.15007750e+07 1.15155350e+07 1.60002060e+07 1.60004900e+07 ... 9.83420024e+09 9.83420142e+09 9.83930042e+09 9.83930111e+09 9.89500004e+09], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
6632900574 2 < 0.1%
 
9353300600 2 < 0.1%
 
5101402435 2 < 0.1%
 
722039087 2 < 0.1%
 
1523049207 2 < 0.1%
 
9407110710 2 < 0.1%
 
3271300955 2 < 0.1%
 
9834200885 2 < 0.1%
 
1139600270 2 < 0.1%
 
6300000226 2 < 0.1%
 
Other values (15666) 15742 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
1000102 1 < 0.1%
 
1200021 1 < 0.1%
 
2800031 1 < 0.1%
 
3600057 1 < 0.1%
 
5200087 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
9895000040 1 < 0.1%
 
9842300485 1 < 0.1%
 
9842300095 1 < 0.1%
 
9842300036 1 < 0.1%
 
9839301165 1 < 0.1%
 

lat
Numeric

Distinct count4747
Unique (%)30.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean47.55917675
Minimum47.1559
Maximum47.7776
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum47.1559
5-th percentile47.3099
Q147.4692
Median47.571
Q347.6774
95-th percentile47.7492
Maximum47.7776
Range0.6217
Interquartile range0.2082

Descriptive statistics

Standard deviation0.1386290362
Coef of variation0.002914874597
Kurtosis-0.6862934938
Mean47.55917675
MAD0.1148537102
Skewness-0.4785235348
Sum749627.7439
Variance0.01921800966
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[47.1559 47.18955 47.2161 47.2571 47.26495 ... 47.66805 47.69935 47.74675 47.75955 47.7776 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
47.6846 14 0.1%
 
47.6955 14 0.1%
 
47.6624 13 0.1%
 
47.6711 13 0.1%
 
47.6647 13 0.1%
 
47.5322 13 0.1%
 
47.6914 12 0.1%
 
47.6368 12 0.1%
 
47.5491 12 0.1%
 
47.6684 12 0.1%
 
Other values (4737) 15634 99.2%
 

Minimum 5 values

ValueCountFrequency (%) 
47.1559 1 < 0.1%
 
47.1593 1 < 0.1%
 
47.1647 1 < 0.1%
 
47.1764 1 < 0.1%
 
47.1775 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
47.7776 3 < 0.1%
 
47.7775 2 < 0.1%
 
47.7772 3 < 0.1%
 
47.7771 1 < 0.1%
 
47.777 1 < 0.1%
 

long
Numeric

Distinct count728
Unique (%)4.6%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean-122.2135195
Minimum-122.519
Maximum-121.315
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum-122.519
5-th percentile-122.387
Q1-122.328
Median-122.229
Q3-122.124
95-th percentile-121.979
Maximum-121.315
Range1.204
Interquartile range0.204

Descriptive statistics

Standard deviation0.1407064419
Coef of variation-0.001151316503
Kurtosis0.9879899801
Mean-122.2135195
MAD0.1150718315
Skewness0.8718665429
Sum-1926329.495
Variance0.0197983028
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[-122.519 -122.466 -122.4155 -122.4105 -122.4005 ... -121.7685 -121.7335 -121.6945 -121.411 -121.315 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
-122.29 86 0.5%
 
-122.365 78 0.5%
 
-122.3 77 0.5%
 
-122.288 76 0.5%
 
-122.363 74 0.5%
 
-122.357 73 0.5%
 
-122.172 72 0.5%
 
-122.362 71 0.5%
 
-122.306 71 0.5%
 
-122.304 71 0.5%
 
Other values (718) 15013 95.2%
 

Minimum 5 values

ValueCountFrequency (%) 
-122.519 1 < 0.1%
 
-122.515 1 < 0.1%
 
-122.514 1 < 0.1%
 
-122.512 1 < 0.1%
 
-122.511 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
-121.315 1 < 0.1%
 
-121.321 1 < 0.1%
 
-121.325 1 < 0.1%
 
-121.352 2 < 0.1%
 
-121.359 1 < 0.1%
 

price
Numeric

Distinct count3034
Unique (%)19.2%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean541317.1757
Minimum82000
Maximum7700000
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum82000
5-th percentile212000
Q1321000
Median450000
Q3644875
95-th percentile1160000
Maximum7700000
Range7618000
Interquartile range323875

Descriptive statistics

Standard deviation372225.8387
Coef of variation0.6876298322
Kurtosis38.08318766
Mean541317.1757
MAD234706.1635
Skewness4.226727018
Sum8532241324
Variance1.38552075e+11
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 82000. 109250. 149950. 150500. 159537.5 ... 1705000. 2005000. 2590000. 3825000. 7700000. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
350000 129 0.8%
 
450000 120 0.8%
 
500000 115 0.7%
 
550000 113 0.7%
 
425000 111 0.7%
 
525000 104 0.7%
 
400000 100 0.6%
 
325000 98 0.6%
 
375000 97 0.6%
 
300000 96 0.6%
 
Other values (3024) 14679 93.1%
 

Minimum 5 values

ValueCountFrequency (%) 
82000 1 < 0.1%
 
82500 1 < 0.1%
 
83000 1 < 0.1%
 
84000 1 < 0.1%
 
85000 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
7700000 1 < 0.1%
 
7060000 1 < 0.1%
 
6890000 1 < 0.1%
 
5350000 1 < 0.1%
 
5110000 1 < 0.1%
 

sqft_above
Numeric

Distinct count835
Unique (%)5.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1792.775473
Minimum370
Maximum9410
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum370
5-th percentile850
Q11200
Median1570
Q32220
95-th percentile3400
Maximum9410
Range9040
Interquartile range1020

Descriptive statistics

Standard deviation828.4035021
Coef of variation0.4620787794
Kurtosis3.437242825
Mean1792.775473
MAD641.3179445
Skewness1.441481453
Sum28257727
Variance686252.3623
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 370. 555. 665. 695. 762.5 ... 4505. 4775. 5485. 6680. 9410. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1300 152 1.0%
 
1200 144 0.9%
 
1010 143 0.9%
 
1220 137 0.9%
 
1060 134 0.9%
 
1180 131 0.8%
 
1140 131 0.8%
 
1320 130 0.8%
 
1250 129 0.8%
 
1080 129 0.8%
 
Other values (825) 14402 91.4%
 

Minimum 5 values

ValueCountFrequency (%) 
370 1 < 0.1%
 
380 1 < 0.1%
 
390 1 < 0.1%
 
410 1 < 0.1%
 
420 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
9410 1 < 0.1%
 
8860 1 < 0.1%
 
8570 1 < 0.1%
 
7880 1 < 0.1%
 
7850 1 < 0.1%
 

sqft_basement
Categorical

Distinct count283
Unique (%)1.8%
Missing (%)0.0%
Missing (n)0
0.0
9362
?
 
333
600.0
 
155
Other values (280)
5912
ValueCountFrequency (%) 
0.0 9362 59.4%
 
? 333 2.1%
 
600.0 155 1.0%
 
500.0 151 1.0%
 
700.0 148 0.9%
 
400.0 144 0.9%
 
800.0 139 0.9%
 
900.0 101 0.6%
 
1000.0 99 0.6%
 
300.0 98 0.6%
 
Other values (273) 5032 31.9%
 
Max length6
Mean length3.815505646
Min length1
Contains charsFalse
Contains digitsTrue
Contains spacesFalse
Contains non-wordsTrue

sqft_living
Numeric

Distinct count912
Unique (%)5.8%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean2084.512372
Minimum370
Maximum13540
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum370
5-th percentile940
Q11430
Median1920
Q32550
95-th percentile3760
Maximum13540
Range13170
Interquartile range1120

Descriptive statistics

Standard deviation918.6176865
Coef of variation0.4406870878
Kurtosis5.747514663
Mean2084.512372
MAD697.627182
Skewness1.500560911
Sum32856084
Variance843858.4539
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 370. 505. 655. 695. 824. ... 4505. 4876.5 5900. 8005. 13540. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1820 102 0.6%
 
1440 100 0.6%
 
1400 97 0.6%
 
1300 95 0.6%
 
1320 94 0.6%
 
1720 91 0.6%
 
1540 91 0.6%
 
1510 89 0.6%
 
1800 89 0.6%
 
1830 88 0.6%
 
Other values (902) 14826 94.1%
 

Minimum 5 values

ValueCountFrequency (%) 
370 1 < 0.1%
 
380 1 < 0.1%
 
390 1 < 0.1%
 
410 1 < 0.1%
 
420 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
13540 1 < 0.1%
 
12050 1 < 0.1%
 
10040 1 < 0.1%
 
9890 1 < 0.1%
 
9640 1 < 0.1%
 

sqft_living15
Numeric

Distinct count694
Unique (%)4.4%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1990.219579
Minimum399
Maximum6210
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum399
5-th percentile1140
Q11490
Median1846
Q32370
95-th percentile3300
Maximum6210
Range5811
Interquartile range880

Descriptive statistics

Standard deviation684.1424953
Coef of variation0.3437522686
Kurtosis1.631805886
Mean1990.219579
MAD535.3537565
Skewness1.102719855
Sum31369841
Variance468050.9538
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 399. 680. 815. 956. 994. ... 3935. 4115. 4765. 5095. 6210.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1560 150 1.0%
 
1540 146 0.9%
 
1440 142 0.9%
 
1800 130 0.8%
 
1500 125 0.8%
 
1580 125 0.8%
 
1460 123 0.8%
 
1760 123 0.8%
 
1640 122 0.8%
 
1680 120 0.8%
 
Other values (684) 14456 91.7%
 

Minimum 5 values

ValueCountFrequency (%) 
399 1 < 0.1%
 
460 1 < 0.1%
 
620 2 < 0.1%
 
670 1 < 0.1%
 
690 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
6210 1 < 0.1%
 
6110 1 < 0.1%
 
5790 5 < 0.1%
 
5600 1 < 0.1%
 
5500 1 < 0.1%
 

sqft_lot
Numeric

Distinct count7927
Unique (%)50.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean15280.82141
Minimum520
Maximum1651359
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum520
5-th percentile1827
Q15048.5
Median7602
Q310720
95-th percentile43709.5
Maximum1651359
Range1650839
Interquartile range5671.5

Descriptive statistics

Standard deviation41822.88332
Coef of variation2.736952564
Kurtosis309.6894817
Mean15280.82141
MAD14107.33686
Skewness13.40978595
Sum240856307
Variance1749153569
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[5.200000e+02 6.785000e+02 8.635000e+02 1.147500e+03 1.351500e+03 ... 2.178670e+05 2.254330e+05 2.862680e+05 4.408265e+05 1.651359e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5000 277 1.8%
 
6000 204 1.3%
 
4000 177 1.1%
 
7200 151 1.0%
 
4800 89 0.6%
 
4500 88 0.6%
 
8400 82 0.5%
 
7500 80 0.5%
 
9600 79 0.5%
 
3600 75 0.5%
 
Other values (7917) 14460 91.7%
 

Minimum 5 values

ValueCountFrequency (%) 
520 1 < 0.1%
 
572 1 < 0.1%
 
609 1 < 0.1%
 
649 2 < 0.1%
 
676 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
1651359 1 < 0.1%
 
1164794 1 < 0.1%
 
1024068 1 < 0.1%
 
982998 1 < 0.1%
 
982278 1 < 0.1%
 

sqft_lot15
Numeric

Distinct count7126
Unique (%)45.2%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean12900.41556
Minimum659
Maximum871200
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum659
5-th percentile2044.2
Q15100
Median7620
Q310107.5
95-th percentile37563
Maximum871200
Range870541
Interquartile range5007.5

Descriptive statistics

Standard deviation27977.23006
Coef of variation2.168707662
Kurtosis169.8269317
Mean12900.41556
MAD10315.5775
Skewness10.00386182
Sum203336350
Variance782725401.8
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[6.590000e+02 9.145000e+02 1.057000e+03 1.169500e+03 1.428000e+03 ... 2.177995e+05 2.180110e+05 2.245520e+05 4.364705e+05 8.712000e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5000 313 2.0%
 
4000 262 1.7%
 
6000 201 1.3%
 
7200 153 1.0%
 
7500 102 0.6%
 
4800 99 0.6%
 
4500 89 0.6%
 
3600 86 0.5%
 
5100 84 0.5%
 
8400 78 0.5%
 
Other values (7116) 14295 90.7%
 

Minimum 5 values

ValueCountFrequency (%) 
659 1 < 0.1%
 
660 1 < 0.1%
 
750 4 < 0.1%
 
755 1 < 0.1%
 
757 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
871200 1 < 0.1%
 
858132 1 < 0.1%
 
560617 1 < 0.1%
 
438213 1 < 0.1%
 
434728 1 < 0.1%
 

view
Numeric

Distinct count5
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.2299835046
Minimum0
Maximum4
Zeros (%)90.4%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile2
Maximum4
Range4
Interquartile range0

Descriptive statistics

Standard deviation0.7613244113
Coef of variation3.31034355
Kurtosis11.32979091
Mean0.2299835046
MAD0.4155811559
Skewness3.45271416
Sum3625
Variance0.5796148592
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=5)
ValueCountFrequency (%) 
0 14241 90.4%
 
2 688 4.4%
 
3 348 2.2%
 
1 245 1.6%
 
4 240 1.5%
 

Minimum 5 values

ValueCountFrequency (%) 
0 14241 90.4%
 
1 245 1.6%
 
2 688 4.4%
 
3 348 2.2%
 
4 240 1.5%
 

Maximum 5 values

ValueCountFrequency (%) 
4 240 1.5%
 
3 348 2.2%
 
2 688 4.4%
 
1 245 1.6%
 
0 14241 90.4%
 

waterfront
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
0
15642
1
 
120
ValueCountFrequency (%) 
0 15642 99.2%
 
1 120 0.8%
 

yr_built
Numeric

Distinct count116
Unique (%)0.7%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1971.111217
Minimum1900
Maximum2015
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1900
5-th percentile1915
Q11952
Median1975
Q31997
95-th percentile2011
Maximum2015
Range115
Interquartile range45

Descriptive statistics

Standard deviation29.33682327
Coef of variation0.0148833932
Kurtosis-0.6465086991
Mean1971.111217
MAD24.52195997
Skewness-0.479157912
Sum31068655
Variance860.6491997
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[1900. 1900.5 1904.5 1909.5 1910.5 ... 2009.5 2012.5 2013.5 2014.5 2015. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2014 400 2.5%
 
2006 335 2.1%
 
2005 327 2.1%
 
2007 308 2.0%
 
2004 304 1.9%
 
2003 302 1.9%
 
1977 301 1.9%
 
1978 289 1.8%
 
1968 286 1.8%
 
2008 264 1.7%
 
Other values (106) 12646 80.2%
 

Minimum 5 values

ValueCountFrequency (%) 
1900 59 0.4%
 
1901 22 0.1%
 
1902 18 0.1%
 
1903 28 0.2%
 
1904 38 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
2015 25 0.2%
 
2014 400 2.5%
 
2013 153 1.0%
 
2012 121 0.8%
 
2011 94 0.6%
 

yr_renovated
Numeric

Distinct count70
Unique (%)0.4%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean82.44023601
Minimum0
Maximum2015
Zeros (%)95.9%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum2015
Range2015
Interquartile range0

Descriptive statistics

Standard deviation397.2126256
Coef of variation4.818188846
Kurtosis19.26758165
Mean82.44023601
MAD158.070601
Skewness4.611241041
Sum1299423
Variance157777.8699
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 0. 967. 1937. 1954.5 1967.5 1982.5 1999.5 2010.5 2012.5 2015. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 15111 95.9%
 
2014 64 0.4%
 
2013 29 0.2%
 
2005 27 0.2%
 
2000 25 0.2%
 
2007 24 0.2%
 
2003 24 0.2%
 
1990 22 0.1%
 
2009 19 0.1%
 
2004 18 0.1%
 
Other values (60) 399 2.5%
 

Minimum 5 values

ValueCountFrequency (%) 
0 15111 95.9%
 
1934 1 < 0.1%
 
1940 2 < 0.1%
 
1944 1 < 0.1%
 
1945 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
2015 13 0.1%
 
2014 64 0.4%
 
2013 29 0.2%
 
2012 7 < 0.1%
 
2011 7 < 0.1%
 

zipcode
Numeric

Distinct count70
Unique (%)0.4%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean98077.55824
Minimum98001
Maximum98199
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum98001
5-th percentile98004
Q198033
Median98065
Q398117
95-th percentile98177
Maximum98199
Range198
Interquartile range84

Descriptive statistics

Standard deviation53.41490569
Coef of variation0.0005446190408
Kurtosis-0.8483166889
Mean98077.55824
MAD46.61151031
Skewness0.4133118275
Sum1545898473
Variance2853.15215
Memory size123.2 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[98001. 98001.5 98002.5 98004.5 98005.5 ... 98151.5 98183. 98193. 98198.5 98199. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
98038 437 2.8%
 
98103 425 2.7%
 
98052 416 2.6%
 
98042 408 2.6%
 
98115 407 2.6%
 
98117 403 2.6%
 
98034 396 2.5%
 
98118 368 2.3%
 
98023 368 2.3%
 
98006 361 2.3%
 
Other values (60) 11773 74.7%
 

Minimum 5 values

ValueCountFrequency (%) 
98001 255 1.6%
 
98002 149 0.9%
 
98003 217 1.4%
 
98004 231 1.5%
 
98005 129 0.8%
 

Maximum 5 values

ValueCountFrequency (%) 
98199 221 1.4%
 
98198 204 1.3%
 
98188 97 0.6%
 
98178 189 1.2%
 
98177 187 1.2%
 

Correlations

Missing values

Sample

First rows

bathroomsbedroomsconditiondatedf_indexfloorsgradeidlatlongpricesqft_abovesqft_basementsqft_livingsqft_living15sqft_lotsqft_lot15viewwaterfrontyr_builtyr_renovatedzipcode
02.253312/9/201412.07641410019247.7210-122.319538000.02170400.025701690724276390.00.019511991.098125
13.004512/9/201431.07248720087547.5208-122.393604000.01050910.019601360500050000.00.019650.098136
22.00332/18/201541.08195440051047.6168-122.045510000.016800.016801800808075030.00.019870.098074
34.50435/12/201451.011723755031047.6561-122.0051230000.038901530.0542047601019301019300.00.020010.098053
42.25336/27/201462.07132140006047.3097-122.327257500.01715?17152238681968190.00.019950.098003
51.00334/15/201581.07241460012647.5123-122.337229500.01050730.017801780747081130.00.019600.098146
62.50333/12/201592.07379350016047.3684-122.031323000.018900.018902390656075700.00.020030.098038
71.00245/27/2014111.07921290026047.6900-122.292468000.0860300.011601330600060000.00.019420.098115
81.753410/7/2014131.07605465007047.6127-122.045400000.013700.0137013709680102080.00.019770.098074
92.00533/12/2015141.57117500057047.6700-122.394530000.018100.018101360485048500.00.019000.098107

Last rows

bathroomsbedroomsconditiondatedf_indexfloorsgradeidlatlongpricesqft_abovesqft_basementsqft_livingsqft_living15sqft_lotsqft_lot15viewwaterfrontyr_builtyr_renovatedzipcode
157522.75538/13/2014215802.09750280010047.4822-122.131679950.036000.036003550943794210.00.020140.098059
157533.755310/15/2014215842.01124900020547.6321-122.2001540000.044700.044702780808889640.00.020080.098004
157542.50334/7/2015215853.08510040380647.6963-122.318467000.014250.014251285117912530.00.020080.098125
157552.00331/26/2015215883.08983420136747.5699-122.288429000.014900.014901400112612300.00.020140.098144
157563.50433/26/2015215902.09793600042947.5537-122.3981010000.02600910.035102050720062000.00.020090.098136
157572.50332/19/2015215912.08299780002147.5773-122.409475000.01180130.013101330129412650.00.020080.098116
157582.50335/21/2014215923.0826300001847.6993-122.346360000.015300.015301530113115090.00.020090.098103
157592.50432/23/2015215932.08660006012047.5107-122.362400000.023100.023101830581372000.00.020140.098146
157600.75236/23/2014215942.07152330014147.5944-122.299402101.010200.010201020135020070.00.020090.098144
157610.752310/15/2014215962.07152330015747.5941-122.299325000.010200.010201020107613570.00.020080.098144